Reproductives signature revealed by protein profiling and behavioral bioassays in termite

Proteins are known to be social interaction signals in many species in the animal kingdom. Common mediators in mammals and aquatic species, they have seldom been identified as such in insects' behaviors. Yet, they could represent an important component to support social signals in social insects, as the numerous physical contacts between individuals would tend to favor the use of contact compounds in their interactions. However, their role in social interactions is largely unexplored: are they rare or simply underestimated? In this preliminary study, we show that, in the termite Reticulitermes flavipes, polar extracts from reproductives trigger body-shaking of workers (a vibratory behavior involved in reproductives recognition) while extracts from workers do not. Molecular profiling of these cuticular extracts using MALDI-TOF mass spectrometry reveals higher protein diversity in reproductives than in workers and a sex-specific composition exclusive to reproductives. While the effects observed with extracts are not as strong as with live termites, these results open up the intriguing possibility that social signaling may not be limited to cuticular hydrocarbons or other non-polar, volatile chemicals as classically accepted. Our results suggest that polar compounds, in particular some of the Cuticular Protein Compounds (CPCs) shown here by MALDI to be specific to reproductives, could play a significant role in insect societies. While this study is preliminary and further comprehensive molecular characterization is needed to correlate the body-shaking triggering effects with a given set of polar compounds, this exploratory study opens new perspectives for understanding the role of polar compounds such as proteins in caste discrimination, fertility signaling, or interspecific insect communication.

Behavioral experiments. For each treatment, thirty workers were isolated from 10 stock colonies containing individuals of every caste. They were sorted on a CO 2 pad and randomly distributed in 4 types of experimental micro-nests containing 30 workers (Fig. 1): without reproductives (R−), with reproductives (R+), with aqueous polar extracts of workers (WE) or with aqueous polar extracts of reproductives (RE). These four combinations were set up using workers and brachypterous reproductives (i.e., secondary reproductives from nymphal lines 46 ) originating from the same 10 colonies. Micro-nests were made with plastic boxes (50 mm diameter; Star-pack Cat#04913) with a cellulosic ultrapure paper (47 mm diameter; Whatman, grade 42 Ashless) humidified with 250 µl of microfiltered water and 0.5 g of Fontainebleau sand (Carlo Erba Reagents, granulometry: 180-500 µm).
For the R+ treatments, reproductives were sexed on a CO 2 pad and then randomly paired (one male with one female) in the corresponding micro-nests. For R− and R+ treatments, 95 µL of ultrapure water was deposited on the sand of each micro-nest, whereas for extract treatments (WE and RE), 95 µL of polar extracts made from either workers or reproductives originated from the same colonies (i.e., each extract was done on the same colony as the tested workers; see details in the "polar extracts" section) were deposited on the sand. Note that the extracts were deposited to the sand contained in an Eppendorf tube. After the deposit of the extract, the sand samples www.nature.com/scientificreports/ were dropped off into the micro-nest, then the reproductives were added if any, and at the end, the 30 workers were added to the micro-nest (Fig. 1). Video recordings were processed for each micro-nest and treatment for the 10 colonies to assess body-shaking behavior of workers in a room at 26 ± 1 °C with 55 ± 5% of relative humidity. Each micro-nest was transferred to a video recording chamber where individuals could settle down for 5 min before being distributed in groups according to treatments and recorded for 5 min (Panasonic HC-VXF990) under infrared lights. Two series of recordings were done 1 h after the micro-nest setup and 24 h later. Videos were recorded in random order under double-blind conditions 47 . The total number of body-shaking events expressed by all the workers during the 5 min of recordings were analyzed using the BORIS freeware v.6.0.5 48 . Note that a body-shaking event lasts on average 0.7 s, and the quantification of their occurrence was done according to previous publications 43,44 . At the beginning of each video recording, the humidity and temperature of the room were measured. At the end of the experiment, the mortality and sex ratio (number of males over females) of each box were calculated. Polar extracts. Workers and brachypterous reproductives used for polar extracts (WE and RE treatments respectively) were isolated from their stock colony and then sexed on a CO 2 pad. For each colony (n = 10), a pool of workers (5 males and 5 females) was extracted for the WE treatments, and a pool of reproductives (1 male and 1 female) was extracted for the RE treatments. We used this ratio of 5 workers for 1 reproductive to reduce quantitative differences during extractions due to body size differences between workers and reproductives, and so to maximize behavioral responses during experiments. Indeed, based on fresh body mean weight (n = 15 colonies), one reproductive (7.93 ± 0.25 mg) is equivalent to 2.88 workers (2.75 ± 0.15 mg). According to preliminary tests and literature data 38 , ultrapure water was selected as the best polar solvent for the extraction of CPCs. Individuals were washed alive in 100 µL of ultrapure water with a vortexing period of 10 s (LP vortex mixer, 1200 rpm) in a 650 µL plastic Eppendorf. Individuals were incubated at room temperature (20 °C) for 5 min. At the end of the incubation time, an additional 10 s of vortexing was done before transferring the extracts into new vials. Over the 100 µL of polar extracts, 95 µL were used for behavioral experiments and 5 µL were analyzed by spectrophotometry for quality control. Three µL of each extract were measured by spectrophotometry (Varian Cary 50-scan®) at 215 nm (with 3 consecutive readings). Extracts with optical density values lower than the mean value of the water used as control were replaced.

Mass spectrometry analyses.
Compound profiling by MALDI-TOF MS. The protein profile of the cuticular polar extracts was analyzed using MALDI-TOF mass spectrometry for workers and brachypterous reproductives of each sex. The protocol described above in the "polar extracts" section was used to extract compounds, except that extractions were done on single individuals and not on a pool of individuals, per caste and sex for the 15 different colonies. An analysis of protein content in extracts was done using spectrophotometric protein quantification (as described above) and MALDI-TOF MS analysis on 5 individual extracts per sex and caste. These analyses showed that worker extracts present a lower concentration of compounds compared to www.nature.com/scientificreports/ reproductives. To limit detection bias in mass spectrometry by leveling up concentrations between workers and reproductives, for subsequent analyses 10 individual extracts of worker per sex were pooled for each colony (n = 10 pools per sex). Since statistical analysis of the worker samples either from individuals (n = 5 per sex) or from pools (n = 10 per sex) did not present significant differences, both datasets were treated as a single dataset. For reproductives, as mentioned above, analyses were always done individually (n = 15 per sex). The 100 µL of cuticular extracts were stored at − 80 °C until treatment for MALDI-TOF MS analysis. Samples were vacuum concentrated 20-fold using a SpeedVac system (Thermo Scientific). The matrix solution and the MALDI deposition method were chosen to optimize the detection of peptides and proteins. Concentrated samples were diluted 5-fold in a solution of 4-hydroxy-α-cyano-cinnamic acid saturated in a solution of 33.3% acetonitrile, 66.6% water, and 0.1% trifluoroacetic acid. Matrix-sample solutions were spotted onto a gold-plated sample probe using the ultrathin layer method 49,50 . The excess liquid was vacuum-aspirated, and the spots were washed with cold 0.1% TFA to remove contaminants. MALDI-TOF-MS spectra were acquired on an UltraFlex I or an UltrafleXtreme mass spectrometer (Bruker Daltonics) in the linear positive mode in the 1000 to 25000 m/z range. Calibrations were performed externally using apomyoglobin and cytochrome c ion peaks, using the nearneighbor spot method. MALDI-TOF-MS spectra were processed using FlexAnalysis 3.3 software (Bruker). Spectra were smoothed using the gaussian algorithm with a width of 1 Da and 5 cycles. Peaks were annotated using the Sum algorithm with a S/N ≥ 4 and a peak width of 5 Da.
For protein/peptide profile comparisons, spectra were exported from the FlexAnalysis software into the mMass open-source software v5.5.0 (Strohalm et al. 2008) for further treatment. Spectra were averaged when necessary and negative spectra were generated by subtraction from zero.

Identification of proteins/petides by top-down MS/MS sequencing.
To confirm that the profiled extracted cuticular compounds were proteins and/or peptides, top-down MS sequencing of species was performed using MALDI-TOF/TOF LIFT or ESI-QTOF CID fragmentation method. For MALDI-TOF/TOF LIFT, the same sample preparation and deposition method as for the profiling approach were used. For ESI-QTOF CID analysis, samples were diluted in the micromolar range in a solution of 50% acetonitrile, 49.8% water and 0.2% formic acid and were analyzed by direct infusion in a 4-GHz MaXis ultra-high-resolution quadrupole-TOF mass spectrometer (Bruker Daltonics) equipped with an electrospray ion source. Spectra were acquired in positive ion MS mode over a 115-3000 m/z range and were externally calibrated with the ESI-L Low Concentration Tuning Mix (Agilent Technologies). ESI mass spectra were processed using DataAnalysis v3.1 software (Bruker Daltonics). The PEAKS software was used for de novo sequencing of the compounds and spectra interpreted manually. To identify proteins based on the de novo sequence tags, we used the UniProt Peptide search tool to interrogate the TrEMBL and SwissProt Eukaryotes databases. MS/MS data were exploited using a Mascot search, performed from top-down spectra in the two databases: Termitoidae protein database and Termitoidae nucleotide database.
Statistical analyses. The number of body-shaking events counted on each video recording was analyzed using a general linear mixed model (LMM) with several explanatory variables: treatments, time, sex-ratios of workers, mortality, temperature, and humidity. Non-significant variables and interactions were removed following a step-by-step procedure which include sex-ratios, mortality, temperature, and humidity. Note that the average temperature and humidity during video recordings were respectively 25.59 °C ± 0.01 (mean ± SEM) and 57.50%RH ± 0.12. No worker mortality was observed after 1 h, and 3 out of 1200 workers were dead after 24 h. The sex-ratio (male over female) were on average 1.47 ± 0.07. Therefore, the explanatory factors used in the final model were the treatments and the time (with their interactions). To fit with homoscedasticity and normal distribution of model residuals, the numbers of body-shaking were log-transformed. Colonies were included as a random factor in all statistical models. To explore the effects of treatments and time separetly (no-significant interaction were observed), the dataset was split and two additional LMMs were conducted on either 1 h or 24 h with treatments as an explanatory factor. Post-hoc pairwise comparisons were conducted using Tuckey HSD. Time effect comparisons (i.e., 1 h vs 24 h) was analyzed for each treatment separately with non-parametric Wilcoxon signed-rank tests. Outliers were calculated with the default interquartile range criterion of 1.5.
For each caste and sex sample, MALDI-TOF mass spectra were treated in FlexAnalysis as described above to calculate peak areas. Peaks with a standard deviation below 1 Da were assigned to the same compound up to m/z 9800. For m/z > 9800, a standard deviation less than 1.7 Da was used as threshold due to the lower resolution and intensity of the peaks. Compounds that were observed only once among the samples were not considered and deleted from the mass list. Although some peaks may correspond to doubly charge species, all m/z values were treated as separate species in statistical analyses. Then peak areas were analyzed using two multivariate analyses. Centered and autoscaled data were tested with and without quadratic transformation. Since no improvement was shown with transformation, further analyses were done without 51 . To discriminate groups based on their protein profiles, Partial Least Squares Discriminant Analyses (PLS-DA) were done on peak areas with the plsda function (mixOmixs package) on 4 pre-defined groups (female reproductives, male reproductives, female workers, and male workers). Then VIP values were calculated to determine the importance of each CPC in the projection. The most representative CPCs with a VIP value greater than 1 were thereafter noted as major compounds 52 . Comparisons between sexes within each caste were done on the relative peak areas (calculated by dividing a peak area by the total area of all the peaks in the spectrum) with non-parametric Wilcoxon signed-rank tests.
At 24 h after the setup of the micro-nests (Fig. 2B), the effect of the treatments was still significant (F (3,22.8) = 6.65, p < 0.001). However, post-hoc tests showed that only the presence of live reproductives (R+: 93.7 ± 8.5) was significantly different from the other treatments (R+ vs R−: 31.6 ± 8.6, t (27)  According to VIP values, there are 87 major CPCs ranging from m/z 1000 to 25000 (Table 1). Profile comparisons of the 87 major CPCs of the 4 groups (RF, RM, WF, WM) revealed that 85 CPCs were specific to reproductives and 2 CPCs were specific to workers. Among all 7 CPCs detected in workers (i.e., CPCs specific to workers or CPCs shared with reproductives), 5 were common to females and males, and 2 were specific to females. Among the 85 CPCs detected in the reproductives, 48 were common to female and male reproductives, 19 were female-specific, and 18 male-specific. The mean relative areas of the major compounds common to both sexes were compared within each caste. These comparisons revealed that three compounds present higher mean relative areas in female reproductives compared to male reproductives: CPC-4963 (W = 153, p = 0.0252), CPC-10196 (W = 156, p = 0.022), and CPC-10416 (W = 166, p = 0.013). No significant sexe differences where found for workers.
Over the first quartile of the 87 majors CPCs (Fig. 6), CPC-4139 presents the highest mean relative area in reproductives, whereas CPC-1330 is the highest peak in workers (see the full list in Table 1). Interestingly, two CPCs are specific to female reproductives (CPC-4284 and CPC-9926) and four CPCs are specific to male reproductives (CPC-223, CPC-2229, CPC-2279, CPC-4969). A visual comparison of the compounds repartition across castes and sexes of the first-quatile of the 87 major CPCs is shown in the supplementary Fig. 1 (CPCs were attributed to a group if more than one individual present the compound). Overall, in R. flavipes the diversity in workers' proteins is very low compared to reproductives (Sup. Figure 2).

Identification of protein/peptide components by top-down MS sequencing.
To confirm that the profiled cuticular extracts correspond to proteins and/or peptides, we performed mass spectrometry fragmentation on CPC-4428 and CPC-10056. Clear fragmentation spectra were produced for these two VIP CPCs. Manual interpretation of MS/MS spectra showed that inter-peak distances correspond to amino acids, as shown for CPC-10056 (Fig. 7).
Further interpretation of the spectra allowed for the determination of sequence tags for these two CPCs ( Table 2). Search with the UniProt Peptide search tool in the TrEMBL and SwissProt protein databases for the sequence tag from CPC-10056 (I/LI/LDDVI/LAVI/L) generated seven hits in Eukaryotes (Table 3). Within the Insecta taxon, the protein G12 precursor from Culex quinquefasciatus (southern house mosquito) was found under the B0WED5 Uniprot entry, while two uncharacterized proteins from Coptotermes formosanus (Formosan subterranean termite) were found under the A0A6L2P7Q1 and A0A6L2PEY0 entries.   Table 1 for the entire list of compounds. www.nature.com/scientificreports/ Coptotermes formosanus in the Termitoidae cDNA library, with a score of 81 (Table 4). This hit contains the manually annotated de novo tag I/LI/LDDVI/LAVI/L from CPC-10056. BLAST alignements on the sequence of the G12 precursor from Culex quinquefasciatus and the JK369675.1 sequence from Coptotermes formosanus showed similarities varying from ~ 39% to ~ 72% with protein G12 from termite species Cryptotermes secundus and Zootermopsis nevadensis. The protein G12 precursor from Culex quinquefasciatus, together with its homolog AEG12 induced in the midgut of adult female mosquitoes (Aedes aegypti) after a blood meal, is thought to have originated from a common ancestor and share a close similarity to a digestion-related cockroach allergen proteins also secreted in the midgut 53,54 . Altogether, these results suggest that CPC-10056 is an ortholog of G12 protein, which is present in cuticular polar extracts and specific to reproductives in our studied species R. flavipes.

Discussion
In this preliminary work, we explore whether there is a potential basis for polar compounds, in particular proteins, to act on reproductive recognition in termites. For this purpose, we used aqueous extraction of polar compounds for both protein profiling and for behavioral assays. It has been previously demonstrated that bodyshaking of workers (a vibratory behavior) was involved in reproductives recognition in the termite R. flavipes, since workers expressed this vibratory behavior in presence of reproductives or eggs 18,43,44 . Here we show that cuticular polar extracts from reproductives contain compounds that have the ability to trigger body-shaking, while extracts from workers do not. This effect is however lower than that observed with live termites. We also show that body-shaking behavior varies over time, with an increase in the number of events after 24 h, except for reproductive extracts. Using a MALDI-TOF MS method designed for protein detection, we have profiled proteins in the polar extracts of different castes and sexes, and obtained by ESI-MS/MS sequence tags on a sample of differentially observed peaks that confirm that the profiled compounds are essentially proteins. Henceforth, we have referred to the compounds detected in polar extracts using this method as Cuticular Protein Compounds (CPCs). Statistical analyses reveal a diverse mixture of CPCs, which is caste-and sex-dependent. Mass spectrometry  www.nature.com/scientificreports/ fragmentation showed at least one example of an ortholog of G12, a digestive protein in termites which could be related to regurgitation by the insect. CPCs composition was more diverse in reproductives (171 CPCs in total) than workers (41 CPCs), with sex-specific CPCs for both male and female reproductives. The presence of a molecular signature of CPCs specific for a sex and/or caste raises the intriguing possibility that proteins could encode for social signals, as was shown for cuticular hydrocarbons (CHCs) or volatile compounds, which are well known to be involved in several social interactions [18][19][20][21][22][23][24] . Chemical profiles with sex-or caste-specific compounds are well-known in numerous insect species, mainly supported in social insects by the cuticular hydrocarbons (CHCs) 25,55 . Indeed, CHCs are the most studied chemical cues since decades, and represent a key element of the division of labor in social insects. CHCs constitute a major part of the lipids covering the cuticle of insects, both in relative and absolute abundance 56 . The CHC composition vary between sexes and castes which has been shown to be a central factor involved in social interactions, social organization, recognition, sex/fertility signaling and mate choice, as well as in reproductive regulation in insects 22,25,[56][57][58] , and more specifically in termites [21][22][23]42 . However, CHCs are not the only chemical components of insects' cuticle 59 . While cuticular polar compounds diversity could encode information in addition to the well-known CHCs, their implication in social interactions has been largely neglected until now. The aqueous extracts prepared for the present study are not expected to contain non-polar compounds. However, we cannot rule out that at least part of the observed effect is due to polar compounds which are not proteins. Still, among polar compounds, proteins have physico-chemical characteristics that make them good candidates to hold nonvolatile information, independently of or in concert with CHCs or non-polar compounds. Moreover, proteins are known to evolve to carry specific functions in organisms, and would be good candidates as specific mediators, whereas polysaccharides in the cuticle are likely to play mostly a structural role, with few variations between castes and sexes. In vertebrates and aquatic species, proteins are well-known to be major mediators for interindividual interactions 3 . In insect societies, with structured nests, we could hypothesize that large biomolecules like CPCs could be deposited at nest's hubs (royal chamber or nest entrance), allowing the transfer of contact www.nature.com/scientificreports/ information to walking nestmates, for example through gustatory perception. This is especially true for social insects living in close environments where contacts between individuals are numerous. Polar compounds such as CPCs could also be exchanged during social interactions like grooming or food transfer (social trophallaxis). Globally in insects, investigations on proteins as signals are scarce and do not concern cuticular proteins 3 . Only few examples of proteins were described on the cuticle in bees, wasps and termites but no functional study was done [36][37][38] . In Drosophila melanogaster, the proteins contained in the seminal fluid are transferred to the female during mating, and they trigger various physiological and behavioral post-mating modifications of the female 60 . In honeybees, the proteins deposited by nurses in the royal jelly induce the development of larvae into future queens 61 . To our knowledge, the only studies investigating the behavioral role of proteins in termites showed the presence on the egg surface of an antibacterial lysozyme and a digestive β-glucosidase enzyme 62,63 . This protein complex called TERP (Termite Egg Recognition Pheromone) allow workers to recognize eggs. Here, we show evidence that CPCs could be as diverse and specific as CHCs. Indeed, the diversity and richness of reproductives' CPCs, and the behavioral response of workers exposed to polar extracts from reproductives, seems to point to their involvement in reproductive recognition. Whether CPCs are cues (i.e., unselected byproducts of the metabolism) or signals (evolved for the benefit to both the sender and the receiver) remain to be determined 2 . Further work is needed to show if a set of CPCs induces behavioral answer in the subterranean termite R. flavipes, and to identify these proteins.  Table 1 for CPCs numbers). * indicates significant differences (Wilcoxon signed-rank test, p < 0.05; Multiple comparisons were corrected with FDR as the p-value adjustement method).     www.nature.com/scientificreports/ Reproductive signaling (or fertility signaling) in eusocial insects is one of the essential mechanism involved in the maintenance of the reproductive division of labor and thus it represents a major functional key of insect societies 17 . We observed for the first time that cuticular polar compounds of reproductives trigger body-shaking behavior in workers. Workers display more body-shaking in presence of live reproductives and extracts of reproductives than in presence of workers' extracts. In the termite R. flavipes, this behavior is a proximate of the presence of reproductives 18,43,44 (also shown in this study). Thus, we bring behavioral evidence that reproductives recognition is mediated by cuticular polar compounds, with proteins and peptides good candidates as active compounds. Interestingly, the hydrocarbon heneicosane, a non-polar compound, has also been identified on the cuticle of male and female reproductives of R. flavipes, inducing body-shaking and antennation of workers 18 . However, some reproductives and some populations of R. flavipes do not produce heneicosane [64][65][66] . Therefore, more studies are needed to disentangle the potential complementary roles of heneicosane and polar compounds such as proteins on reproductives recognition. The presence of non-polar compounds is one of a few non-mutually exclusive explanations for the difference in effect between live reproductives and extracts we observed. Other chemical/physical cues, yet to be discovered could also be involved, which again could explain why live reproductives induce more body-shaking events than their cuticular polar extracts. For example, the cuticle structure itself could act as a carrier matrix with potentiation of the effect. Several other phenomena, alone or in concert, could account for the lower response elicited by a sand/cellulose paper support. First, proteins can adsorb on surfaces, and such adsorption on silica and cellulose material tends to be irreversible, which would decrease the apparent concentration, and thus the availability of the compounds for detection by workers. Second, some proteins may act synergistically 42 , so that an alteration of the ratio of some compounds might decrease their combined effect on workers. Taking all these considerations into account, future experiments aimed at identifying active proteins in behavioral assays should include the use of termite dummies, preferably foregoing the use of glass which bears the greatest adsorption risk. It should be noted that protein profiling was done on cuticular extracts directly, so that they more directly reflect the actual set of compounds present on the cuticle than extracts deposited on supports. Non-volatile compounds such as CPCs imply direct contact between the receptors and the substrate. When workers are exposed to a live reproductive or to an extract, in our setup it is not easy to determine if the workers exhibit body-shaking behavior immediately after contact, or if there is a delay in the behavior. The behavior could also be expressed after a short time-lapse and is not necessarily expressed directly during the contact event. Moreover, the receptor could be localized on the antennae or other body parts like the legs. It will be interesting to determine which receptors are involved in this behavior and what are their localization.
Time dependent comparisons showed an increase of body-shaking events at 24 h compared to 1 h in all treatments, except for reproductives' extracts where it was only marginally significant (Z = 8.5, p = 0.059; Fig. 3). This general increase, for both the mean values and also the data variances, may reflect a behavioral instability due to social isolation. Indeed, tested micro-nests are fragments of larger mother colonies displaying rich interactions. Therefore, workers might be more behaviorally disturbed after 24 h of isolation than after 1 h. At 24 h (Fig. 2B), the presence of reproductives still induce body-shaking but the reproductives extracts do not anymore. Alternatively, the short-lived effect of the extracts might be due to a lack of stability of the polar compounds over time. For proteins, this is a credible explanation as they would be susceptible to proteolysis by regurgitated gastric enzymes. On the other hand, some compounds may be long-lived: Turillazzi et al. reported long-lasting peptides on the cuticle and in the venom of Polistes: dominulin A (1854 Da) and dominulin B (1909 Da) 40 . It is possible that CPCs must be deposited repeatedly on substrates to maintain their effect, or that the cuticle is needed as a stabilizing carrier. Since extracts and focused workers were set up at once at the beginning of the experiments, we could not test the "stability over time of the extracts" separately from the "isolation time effect on workers", preventing conclusion on these hypotheses. It also would be interesting to determine the speed at which proteins/peptides in the extracts degrade or lose their structural integrities to the point of not be able to carry information anymore.
Cuticular polar extracts from all caste and sex are composed of a diverse mixture of proteins and peptides up to the 25 kDa range according to our mass spectrometry analyses. CPCs from reproductives are more diverse than those from workers' cuticle, with CPC-1330, CPC-4139 and CPC-4998 as main compounds separating both castes. In the paper wasp P. dominulus, foundress and their first emerged workers present clear specific CPCs (918 to 2679 Da) 37 . This caste specificity of CPCs was also observed in 3 termites species including our studied species R. flavipes 38 (note that R. santonensis is a synonym of R. flavipes 67 ). However, the compounds observed in female reproductives seems to be different from compounds in our study, except for the compound at m/z of 4141, which may correspond to CPC-4139 (m/z 4138.6). CPCs are also sex-specific for reproductives and workers in our study. Sex-specific CPCs are less common in the literature, but they have been described in a species phylogenetically close to termites, the cockroach Leucophaea maderae. In this species, males present two specific cuticular proteins (LMA-P18 and LMA-P22) progressively accumulated from ecdysis to sexual maturity 35 . The protein LMA-P22 is supposed to be involved in sexual behavior and belong to the lipocalin family known to bind hydrophobic compounds like pheromones 68 . In our study, among the 87 major CPCs we identified in reproductives 19 CPCs specific to females and 18 to males, including 3 CPCs more abundant in females than males, suggesting a function of CPCs in sex signaling and caste regulation. Indeed, the production of reproductives in R. flavipes is under a dual sex-specific regulation 22 . While the presence of female reproductives inhibits the production of new female reproductives, it also stimulates the molting of male workers into new male reproductives. A reciprocal phenomenon is observed for male reproductives. In workers, only females present sex-specific CPCs with 2 compounds (CPC-4139 and CPC-4158). Interestingly, CPC-4139 is the most abundant CPC in reproductives, whereas in workers, it is found in very low abundance, in only 2 samples out of 30. This low abundance and frequency may reflect the presence of female workers starting to develop into reproductives. Removing those individuals from the dataset did not change the results on group separations and www.nature.com/scientificreports/ on the identification of caste/sex specific CPCs. We also noticed that a few reproductives (particularly males) present CPC profiles close to workers (Fig. 5), suggesting that these males may be new immature reproductives. It would be interesting to monitor the maturation of the CPC profiles for individuals engaged into reproductive differentiation, such as workers and nymphs, during the different molting phases, as well as to track body-shaking behaviors according to the sex of the reproductives. Overall, these results are consistent with the possible role of CPCs in reproductive recognition, sex signaling and caste regulation. In insects, secreted and cuticular proteins/peptides have been mainly identified to possess antimicrobial activities 37,62,69 . Mass spectrometry proteomics explorations allowed us to confirm that polar extracts contain proteins and may bear post-translational modifications such as glycosylations. Our preliminary fragmentation study shows that CPC-10056 is an ortholog of the midgut enzyme protein G12, also identified in several termite species. Interestingly, at least one digestive enzyme is known to be involved in the reproductive division of labor in the termite C. secundus 70 : the Neofem2 gene encodes a wood digestion enzyme (β-glucosidase). Indeed, silencing Neofem2 in queens induce fights between workers for the royal succession in queenless colonies. Moreover, this β-glucosidase is involved in egg recognition in the termite R. speratus 63 , and in the mating behavior of the cockroach Leucophaea maderae 71 . Altogether, these observations suggest that the function of the β-glucosidase may have evolved from wood digestion to pheromonal caste regulation in cockroaches and termites, even though other chemicals could contribute to these pheromonal functions 18,63,70,71 .
Future studies will be needed to sequence and identify reproductive-specific CPCs and to test their behavioral functions, either individually or in synergy, to determine if they play a role in the regulation of social interactions in insects, as observed in vertebrates or aquatic organisms 3 . Sex-specific variations observed in reproductives also open new fields of inquiry for the potential role of proteins in the dual sex-specific regulation of the reproductives which is occurring in subterranean termites 22,72 . The scope of our study is limited by the fact that sand-deposited polar extracts may be depleted in some proteins and other polar molecules that may act as synergistic compounds. Despite these limitations, the combination of protein profiling with behavioral bioassays represents a powerful tool to explore unknown aspects of insect communications, which when further correlated through the identification of the active proteins, could help to answer unresolved questions about social insect communication.

Data availability
The datasets generated during the current study are available in the Zenodo repository (https:// zenodo. org/ record/ 73195 84).